AITopics | high score

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Industry:

Banking & Finance (0.96)
Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)

Neural Information Processing SystemsFeb-7-2026, 10:24:53 GMT

09b69adcd7cbae914c6204984097d2da-Paper.pdf

algorithm, differential privacy, subspace, (16 more...)

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
(2 more...)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Data Science > Data Mining (0.68)
Information Technology > Security & Privacy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsOct-2-2025, 03:00:59 GMT

Privately Learning Subspaces

Private data analysis suffers a costly curse of dimensionality.

artificial intelligence, data mining, machine learning, (19 more...)

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
(2 more...)

Industry: Information Technology > Security & Privacy (0.88)

Technology:

Information Technology > Data Science > Data Mining (0.88)
Information Technology > Security & Privacy (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsOct-2-2025, 00:41:55 GMT

0c7119e3a6a2209da6a5b90e5b5b75bd-AuthorFeedback.pdf

artificial intelligence, inductive learning, machine learning, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.30)

Jain, Shreyans, Yost, Alexandra, Abdullah, Amirali

Sycophancy as compositions of Atomic Psychometric Traits

arXiv.org Artificial IntelligenceAug-28-2025

Sycophancy is a key behavioral risk in LLMs, yet is often treated as an isolated failure mode that occurs via a single causal mechanism. We instead propose modeling it as geometric and causal compositions of psychometric traits such as emotionality, openness, and agreeableness - similar to factor decomposition in psychometrics. Using Contrastive Activation Addition (CAA), we map activation directions to these factors and study how different combinations may give rise to sycophancy (e.g., high extraversion combined with low conscientiousness). This perspective allows for interpretable and compositional vector-based interventions like addition, subtraction and projection; that may be used to mitigate safety-critical behaviors in LLMs.

artificial intelligence, large language model, natural language, (15 more...)

2508.19316

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)

Neural Information Processing SystemsAug-22-2025, 02:23:20 GMT

We thank R2 and R3 for their vote of confidence and giving this work at high score of 9 and 8 respectively

We thank all our reviewers for their feedback! We will respond to (R2, R3) separately to R1 due to different concerns. We thank R2 and R3 for their vote of confidence and giving this work at high score of 9 and 8 respectively. It means a lot to us - to see our ideas accepted by our peers at NeurIPS who also believe that our "work opens many new We experimented with setting all weights to a single fixed value, e.g. However, if we then nudge that value by a small amount, to say 0.6, the network fails completely at the In fact, the best performing values were outside of this training set. We will cite and discuss this work in our revised paper. NeurIPS2019 will discuss similar themes and we are excited to see more ideas in this direction from both communities. We agree with R3 that scaling up is the next step. Stanley2009) to scale W ANNs architectures to scales able to compete on benchmarks such as ImageNET and Atari. We wish to take the time to conduct this investigation thoroughly, and plan to report the findings in a follow up paper. W ANNs. We would also like to thank R3 for the other minor suggestions, we will clarify the labels and information. In the spirit of this extreme experiment the algorithm used was purposefully kept simple. Our original intention was to focus only on continuous-control RL experiments, and decided to run MNIST "for fun" We could have confined the paper to only RL experiments (most RL papers don't run MNIST Finally, we do believe there is a connection to the neuroscience field. "What Artificial Neural Networks can Learn from Animal Brains" (Zador2019) whose central theme is that "The first

contribution, experiment, high score, (8 more...)

Industry: Health & Medicine (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Shamir, Gil I., Warmuth, Manfred K.

Selective Matching Losses -- Not All Scores Are Created Equal

arXiv.org Artificial IntelligenceJun-11-2025

Learning systems match predicted scores to observations over some domain. Often, it is critical to produce accurate predictions in some subset (or region) of the domain, yet less important to accurately predict in other regions. We construct selective matching loss functions by design of increasing link functions over score domains. A matching loss is an integral over the link. A link defines loss sensitivity as function of the score, emphasizing high slope high sensitivity regions over flat ones. Loss asymmetry drives a model and resolves its underspecification to predict better in high sensitivity regions where it is more important, and to distinguish between high and low importance regions. A large variety of selective scalar losses can be designed with scaled and shifted Sigmoid and hyperbolic sine links. Their properties, however, do not extend to multi-class. Applying them per dimension lacks ranking sensitivity that assigns importance according to class score ranking. Utilizing composite Softmax functions, we develop a framework for multidimensional selective losses. We overcome limitations of the standard Softmax function, that is good for classification, but not for distinction between adjacent scores. Selective losses have substantial advantage over traditional losses in applications with more important score regions, including dwell-time prediction, retrieval, ranking with either pointwise, contrastive pairwise, or listwise losses, distillation problems, and fine-tuning alignment of Large Language Models (LLMs).

machine learning, natural language, sensitivity, (19 more...)

2506.04446

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Veuthey, Jaime Raldua, Majid, Zainab Ali, Hariharan, Suhas, Haimes, Jacob

MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks

arXiv.org Artificial IntelligenceApr-22-2025

As Large Language Models (LLMs) advance, their potential for widespread societal impact grows simultaneously. Hence, rigorous LLM evaluations are both a technical necessity and social imperative. While numerous evaluation benchmarks have been developed, there remains a critical gap in meta-evaluation: effectively assessing benchmarks' quality. We propose MEQA, a framework for the meta-evaluation of question and answer (QA) benchmarks, to provide standardized assessments, quantifiable scores, and enable meaningful intra-benchmark comparisons. We demonstrate this approach on cybersecurity benchmarks, using human and LLM evaluators, highlighting the benchmarks' strengths and weaknesses. We motivate our choice of test domain by AI models' dual nature as powerful defensive tools and security threats.

benchmark, large language model, natural language, (19 more...)

2504.14039

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Molchanova, Maria, Mikhailova, Anna, Korzanova, Anna, Ostyakova, Lidiia, Dolidze, Alexandra

Exploring the Potential of Large Language Models to Simulate Personality

arXiv.org Artificial IntelligenceFeb-12-2025

With the advancement of large language models (LLMs), the focus in Conversational AI has shifted from merely generating coherent and relevant responses to tackling more complex challenges, such as personalizing dialogue systems. In an effort to enhance user engagement, chatbots are often designed to mimic human behaviour, responding within a defined emotional spectrum and aligning to a set of values. In this paper, we aim to simulate personal traits according to the Big Five model with the use of LLMs. Our research showed that generating personality-related texts is still a challenging task for the models. As a result, we present a dataset of generated texts with the predefined Big Five characteristics and provide an analytical framework for testing LLMs on a simulation of personality skills.

large language model, machine learning, natural language, (19 more...)